Building a Knowledge Base using Microblogs: the Case of Cultural MicroBlog Contextualization Collection

نویسندگان

  • Hoang Thi Bich Ngoc
  • Josiane Mothe
چکیده

The Cultural MicroBlog Contextualization (CMC) Workshop provides a collection of tweets on cultural events related to festivals. Given the size of a tweet, the information obtained by a single post is often very partial. We develop the idea that using a set of tweets about an event could enable having a more complete view of that event by combining all information posted. In this paper, we propose a model to represent the collection of microblogs into a knowledge base. Considering the set of tweets on festival events from CMC, we define a domain ontology and show how to populate this ontology based not only on the tweet collection but on external data too. We detail how the knowledge base could be used to provide a complete view of an event. This paper presents the preliminary results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

LIG at CLEF 2016 Cultural Microblog Contextualization: TimeLine illustration based on Microblogs

This paper presents the approach used by the LIG-MRIM research group to the participation of the task 3 (TimeLine illustration based on Microblogs) for the CLEF of Cultural Microblog Contextualization track. This task deals with the retrieval of tweets related to cultural events (music festivals) . For the content-based elements, we use the classical BM25 model [4]. Then, we diversify the resul...

متن کامل

Building a Knowledge Base Using Microblogs: the Case of Festivals and Location-Based Events

Social media like Twitter are used during an event (catastrophe, cultural events ...) to collaboratively comment or advise on that event. Social network users are then notified through the people they follow or by seeking tweets related to the event. However, given the size of a tweet, the information obtained by a single post is often very partial. Using a set of tweets about an event makes it...

متن کامل

Microblog Contextualization using Continuous Space Vectors: Multi-Sentence Compression of Cultural Documents

In this paper we describe our work for the MC2 CLEF 2017 lab. We participated in the content analysis task that involves filtering, language recognition and summarization. We combine Information Retrieval with Multi-Sentence Compression methods to contextualize microblogs using Wikipedia’s pages.

متن کامل

Improving Microblog Retrieval from Exterior Corpus by Automatically Constructing a Microblogging Corpus

A large-scale training corpus consisting of microblogs belonging to a desired category is important for highaccuracy microblog retrieval. Obtaining such a large-scale microblgging corpus manually is very time and laborconsuming. Therefore, some models for the automatic retrieval of microblogs from an exterior corpus have been proposed. However, these approaches may fail in considering microblog...

متن کامل

Improving Microblog Retrieval from Exterior Corpus by Automatically Constructing Microblogging Corpus

A large-scale training corpus consisting of microblogs belonging to a desired category is important for highaccuracy microblog retrieval. Obtaining such a large-scale microblgging corpus manually is very time and laborconsuming. Therefore, some models for the automatic retrieval of microblogs from an exterior corpus have been proposed. However, these approaches may fail in considering microblog...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016